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SELECTION PR OCEDURE USING PRODRUG/ENZYME SYSTEM 

The present invention relates to a process for the selection from a gene library of a gene 
encoding an enzyme capable of catalysing the conversion of a prodrug to its active drug form. 
Prodrug activating enzymes convert prodrugs, which are relatively non-toxic, to cytotoxic 
agents that have been shown to be effective in killing tumour cells (Connors, T.A., (1995) 
Gene Therapy 2:702-709). It has been hypothesised that selective delivery of prodrug 
activating enzymes to tumours would allow the local generation of a cytotoxic agent, through 
the systemic administration of a substrate prodrug. This would allow a greater differential 
between systemic toxicity and anti-tumour activity than can be achieved by systemic delivery 
of cytotoxic agents. 

The activating enzyme may be targeted to the tumour cells by coupling it to a tumour-targeting 
moiety such as an antibody of suitable specificity ("antibody-directed enzyme prodrug 
therapy", = ADEPT). Alternatively, a gene encoding the prodrug activating enzyme may be 
delivered to the tumour cells, either as some kind of DNA complex, or in various kinds of viral 
vectors (GDEPT or VDEPT, = gene / virus directed enzyme prodrug therapy). (Huber et al, 
1991; Huber et al, 1994;Harris et al y 1994; Searle et al, 1996). 

Similar systems may be effective in other instances requiring selective cell killing, such as, for 
example, countering restenosis of blood vessels following balloon angioplasty, selective tissue 
ablation in transgenic animals and as a fail-safe suicide system to allow killing of (genetically 
modified) immune-effector cells present in allogeneic bone-marrow transplants. 

One known prodrug activating enzyme is nitroreductase from E. coli, which can activate the 
prodrug CB1954 (5-[aziridin-l-yl]-2,4-dinitrobenzamide) by reduction at either the 2 or the 4 
position. Further, non-enzymatic reaction of this product leads to the generation of a potent bi- 
functional alkylating agent which can produce interstrand crosslinks in DNA, and which is 
therefore highly cytotoxic. Experiments using this approach have demonstrated that retroviral 
transfer of a nitroreductase expression cassette can sensitise a number of colorectal, pancreatic 
and ovarian tumour cell lines to the prodrug CB1954 (McNeish et al. 9 1998; Green et a/., 
1996). Significantly increased survival and apparent cures have also been observed following 
treatment of mice harbouring nitroreductase-expressing tumours with CB1954 (McNeish et al, 
1998). 
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However, the efficiency of this method of selective tumour cell killing appears presently to be 
sub-optimal, partly due to the difficulty of achieving sufficiently high levels of gene transfer to 
a sufficient number of tumour cells in vivo. The ultimate therapeutic potential of the system in 
treating cancer patients will depend upon a number of factors, including the efficiency with 
which the gene encoding the prodrug activating enzyme can be delivered to and expressed 
within the tumour cells to produce the enzyme and the selectivity of these processes for tumour 
cells versus the cells of normal, vital tissues or organs within the patient. Clinical efficacy will 
also depend upon a number of factors such as the intrinsic susceptibility of the tumour cells to 
the activated prodrug, and the capacity of host defence mechanisms such as the immune system 
to work co-operatively with the gene therapy to assist in killing tumour cells that are not 
directly killed by the activated prodrug. 

A number of other parameters also appear to be important. For example, the concentration of 
prodrug to which the cells are exposed is a major factor determining the rate at which the 
activated drug form can be generated to produce its toxic effect. Furthermore, the greater the 
duration of exposure of the cell population to the prodrug, the greater the amount of activated 
drug form which can be generated. In the body, the substrates for the enzymatic reaction are 
likely to be present in non-limiting amounts, (although this will of course depend upon the 
dose which has been administered, and the pharmacokinetics of its distribution within the 
body) so the rate at which the activated drug species can be generated, at any given local 
concentration of prodrug, will depend upon the amount of catalytically-active enzyme that is 
available in the cell. Perhaps most important is the efficiency of the enzyme; at a given 
concentration of substrate and enzyme, the reaction will proceed faster if the K M is lower for 
substrate, or if the ka, is greater, or both. 

Naturally, one would attempt to optimise the exposure (concentration x duration) of the tumour 
to the prodrug by selecting the most suitable route and schedule of delivery, however it is 
likely that the maximum achievable exposure would be limited by factors such as toxicity to 
the patient or solubility of the prodrug. However, there are in principle two routes to "improve" 
the enzyme. One involves site-directed mutagenesis. An alternative route requires no prior 
knowledge of the structure of the enzyme, but relies on the production of a large number of 
random mutants, and the use of one or more screening procedures to identify the rare, 
improved variants against the large background of indifferent or detrimental mutants. 
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Such strategies of random mutagenesis and selection have been shown to yield improved 
enzyme variants. For example, Black et ai introduced random sequence into a region of the 
thymidine kinase gene from herpes simplex virus, and screened more than one million 
independent clones by positive selection on plates containing 5-fluorodeoxyuridine, to identify 
426 which retained a threshold level of thymidine kinase activity. These were then screened 
individually to identify a subset that demonstrated improved characteristics with the prodrugs 
acyclovir and/or ganciclovir (Black et al, 1996). Another group applied the PCR-shuffling 
technique with alternating rounds of selection and PCR shuffling has resulted in a 32,000-fold 
enhancement of the activity of TEM-1 P-lactamase against the antibiotic cefotaxime (Stemmer, 
1 994); selection was accomplished by plating bacteria containing the plasmid library on agar 
containing the antibiotic. 

However, although such precedents provide encouragement that it is possible to make 
improved enzymes via such mutagenesis strategies, no efficient selection strategies have yet 
been described which could be generally applied to the improvement of any prodrug activating 
enzyme. Consequently, there remains a need for a system that allows the selection of variant 
prodrug activating enzymes that exhibit improved catalytic activity. 

Summary of the invention 

According to the present invention there is provided a method of selection of a nucleic acid 
encoding an enzyme that is capable of converting a prodrug to its active drug form comprising 
the steps of: contacting a population of bacteria transformed with a bacteriophage library with a 
prodrug in a medium, wherein; 

i) the transformed bacteria are in the lysogenic state, and 

ii) when converted to its active drug form ? the prodrug causes activation of the proteolytic 
activity of bacterial RecA and lysis of the bacteria; 

separating bacteriophage particles released by lysis of the bacteria from said medium, and 
analysing the genotype of said separated bacteriophage for a nucleic acid encoding the enzyme. 

By prodrug is meant any drug molecule that is itself relatively inert pharmaceutically but which 
has a pharmacological effect once activated in an organism by biological, biochemical, 
chemical or physical means. The enzyme encoded by the selected nucleic acid should be a 
prodrug activating enzyme that acts catalytically to convert prodrug to its active drug form. 
Alternatively, there may be more than one reaction step between the prodrug and the final 
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chemical species which is directly toxic; for example, the toxic 5-[aziridin-l-yl]-4- 
hydroxylamino-2-nitrobenzamide which can be produced by nitroreductase from CB1954 is 
believed to undergo further, non-enzymatic reaction to generate 4-(N-acetoxy)-5-[aziridin-l - 
yl]-2-nitrobenzamide, which is believed to be the chemical species which reacts with DNA to 
produce the toxic crosslinks (Knox, R. J., F. Friedlos, T. Marchbank, and J. J. Roberts. (1991), 
Biochem. Pharmacol, 42:1691-1697;. The toxic product cannot be generated in any significant 
quantity, without the presence of the activating enzyme. 

The method of the invention requires the use of a temperate bacteriophage and is based on the 
ability of such bacteriophage to propagate in bacteria in either of two modes. The first mode 
(the lytic phase) involves rapid, episomal replication of the bacteriophage leading to assembly 
of progeny virus particles and lysis of the infected bacteria cell. The other mode (the lysogenic 
phase) does not kill the host bacterium; instead, the bacteriophage genome becomes covalently 
integrated into the DNA of the bacterial chromosome, where it is replicated along with the host 
DNA, and is transmitted to the daughter cells. Once established, the lysogenic state is 
maintained by the cl repressor protein of the bacteriophage, which binds to sequences within 
the bacteriophage genome to promote its own synthesis, and repress transcription of the 
remainder of the bacteriophage genome. In this state, the bacterium is protected from further 
infection by other, similar bacteriophages, because the presence of the cl repressor in the cell 
also prevents the expression of genes required for lytic growth from incoming bacteriophage 
genomes. In this state, the bacterium is called a "lysogen^ and the integrated copy of the 
bacteriophage is called a prophage. Establishment of this state is called lysogeny (Hendrix, R. 
W., J. W. Roberts, F. W. Stahl, and R. A. Weisberg (1983) Lambda II). 

Certain treatments of lysogenic bacteria can lead to activation of a proteolytic activity of the 
bacterial RecA protein. This in turn can lead to cleavage and hence inactivation of the 
bacteriophage cl repressor protein, thus triggering lytic growth of the prophage. The 
inactivation of cl repressor allows expression of the bacteriophage genes required for lytic 
growth, including functions which excise the prophage genome from the bacterial 
chromosome, leading to its episomal replication, and the assembly of many copies of the 
bacteriophage into mature bacteriophage particles, which are then released upon lysis of the 
host cell. (Roberts, J. W., and R. Devoret. (1983) Lysogenic induction, in R. W. Hendrix, J. 
W. Roberts, F. W. Stahl and R. A. Weisberg (ed.). Lambda II; Cold Spring Harbor Laboratory 
Press). 
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In the temperate bacteriophage Lambda, prophage induction can be induced in this manner by 
agents that damage DNA (for example, ultraviolet light, ionising radiation, activated chemical 
carcinogens, certain anti-tumour drugs), as well as agents which disrupt DNA replication, 
including 5-fluorouraciL and nucleoside- or nucleotide-analogues. The mechanism is thought 
to be that any of these agents lead to the generation of single-stranded DNA in the cell, which 
is required, along with ATP or dATP and magnesium ions, to activate a specific protease 
activity of bacterial RecA. This normally leads, via cleavage of the LexA protein (a repressor 
of transcription), to the activation of a number of genes in the bacterium associated with DNA 
repair functions, allowing better survival of the DNA damage; this is called the SOS response. 

The principle of the invention is that a library of DNA molecules encoding a number of 
variants of prodrug activating enzymes is inserted into the genome of a bacteriophage. The 
bacteriophage vectors are then introduced into host bacteria, and lysogens are selected. In 
general, each bacterium contains only one bacteriophage vector type and the inserted gene is 
expressed within the lysogenic bacteria. Upon exposure of the population of lysogenic bacteria 
to prodrug, the toxic derivative of the prodrug will only be produced within bacterial cells 
containing a prodrug activating enzyme, and then at a rate which depends upon the catalytic 
activity of the prodrug activating enzyme that is present within that cell. The presence of the 
toxic product within the cell leads to the activation of the protease activity of the bacterial 
RecA protein, leading to cleavage of the cl repressor protein of the bacteriophage, and thus 
triggering lytic growth. The prophage in that particular cell is thus excised and replicated, 
leading to lysis of the host cell and the release of around 100 progeny bacteriophage particles 
into the medium, from which they can be readily recovered. 

At any given concentration of prodrug, the toxic product will be produced faster in bacteria 
containing a bacteriophage vector encoding a more efficient enzyme. Lytic growth of 
bacteriophage will therefore be triggered relatively early in those instances when the 
bacteriophage encodes a highly active prodrug activating enzyme. Lytic growth will be 
triggered relatively late (or not at all above the background level), if the encoded enzyme 
variant has little or no activity against the prodrug. Thus, at limiting exposure to prodrug (low 
concentration and/or short duration of exposure), the copies of the vector recovered are 
enriched in those encoding the most efficient variants of the prodrug activating enzyme. 

Any temperate bacteriophage may be used in the invention, as will be clear to those of skill in 
the art. The bacteriophage lambda (X) is the bacteriophage of choice. All genes and functions 
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of the temperate bacteriophage that are required for the initial infection, and establishment and 
maintenance of lysogeny of bacteria should be retained. Furthermore, all genes and functions 
required for the induction and completion of the lytic cycle upon activation of RecA protease 
activity should be present in the bacteriophage. 

As will be clear to the skilled artisan, other features of the bacteriophage are desirable. For 
example, the bacteriophage genome should contain a restriction site at which a DNA fragment 
(encoding a prodrug activating enzyme) can be inserted and expressed in the lysogenic state. 
The presence of suitably positioned regulatory DNA sequences to facilitate the expression of 
the inserted DNA in the bacterial lysogen is also advantageous, since it removes the need for 
the inserted DNA to contain its own regulator}' sequences required for expression of the 
encoded proteins. The insertion site could be designed so as to increase the efficiency of 
insertion of suitable DNA fragments, for example, by the use of suitable, asymmetrical sites to 
favour the insertion of the exogenous DNA in a preferred orientation, and to reduce the 
background level of non-productive ligation products. This would facilitate the generation of a 
library having a high level of diversity. The inclusion of a selectable marker such as an 
antibiotic-resistance gene within the bacteriophage vector would facilitate the separation of 
lysogens from bacteria not infected with the bacteriophage vector. The bacteriophage could 
also be designed to express its encoded protein only in the lysogenic state. As will be clear to 
the skilled artisan, this might be by using a regulated promoter allowing selective expression, 
for example, using the tac promoter and the inducer IPTG in the media. 

Methods for the introduction of genes or gene fragments into bacteriophage DNA and for the 
construction of bacteriophage libraries are known in the art, for example, see Sambrook et al, 
1989, Cold Spring Harbor Laboratory Press. Of course, it is not necessary for the entire gene for 
the prodrug activating enzyme to be cloned into the bacteriophage DNA - fragments may also be 
used. However, if a fragment is used, this should encode a protein fragment that retains catalytic 
activity in activating prodrug to its active drug form. 

The bacteriophage library should contain a library of genes. A library of genes may be used that 
is derived from a single organism, to select for a gene or genes which exhibit the desired catalytic 
activity, for example as a form of expression cloning to identify novel active proteins in a 
particular organism. However, in most instances the gene encoding the wild type enzyme of 
interest will already be available in the art - nucleic acid coding sequences for use in the 
invention are widely reported in the scientific literature and are also available in public 
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databases. Further, the DNA may be commercially available, may be part of known cDNA 
libraries or may be generated using standard molecular biology and/or chemistry procedures as 
will be clear to those of skill in the art. In such instances, the invention will be most useful to 
allow the selection of genes encoding improved variants of the wild type protein that exhibit 
improved catalytic activity. Accordingly, a gene library will preferably comprise a selection of 
mutated variants of the wild type gene. 

Many different approaches to random mutagenesis may be used to generate the library. For 
example, chemical modification can be used to alter bases in the gene sequence, leading to 
changes in the DNA sequence upon replication or repair. Examples of chemical modification 
include treatment of a single stranded form of the gene with bisulphite, which converts some of 
the cytosine residues to uracil residues. When the opposite strand of DNA is then made by a 
DNA polymerase, an adenine residue is inserted opposite the uracil nucleotide. In this way, a 
C-G base-pair is converted to a U-A base-pair, and, subsequently, to T-A (see Watson, J. D., 
M. Gilman, J. Witkowski, and M. Zoller, (1992) Recombinant DNA. 2 nd Edition; Scientific 
American Books; W. H. Freeman & Co. New York, and references therein). DNA polymerases 
with relatively high error rates can be used for replication of the gene e.g. the avian 
myeloblastosis reverse transcriptase can be used in this way (Singh, M., S. Heaphy, and M. J. 
Gait. ( 1986), Prot. Eng. 1:75-76; see also Abarzua, P and Marians, K J 1984; Proa Natl. Acad 
Sci. USA 81, 2030-2034), as can the Thermus aquations (Taq) DNA polymerase, which may 
be used repeatedly as in the polymerase chain reaction (Leung, D W ? Chen, E and Goeddel, D 
V (1989); Technique 1 ; 1 1-1 5). Another mutagenesis strategy has been described by Stemmer 
(Stemmer, 1994); in this method, the gene is randomly fragmented and then reassembled by a 
process similar to the polymerase chain reaction, with an associated introduction of mutations. 
An advantage of this particular strategy lies in the combination of alternating rounds of 
selection, and the mutagenic "PCR-shuffling ? procedure, which can recombine beneficial 
mutations from different gene variants into a single gene, and facilitate the separation of 
deleterious changes from those that are beneficial. 

An approach intermediate between that of site-directed mutagenesis and completely random 
mutagenesis throughout the gene might be appropriate where some information has indicated 
that particular regions of the enzyme are likely to be critically involved in the reaction of 
interest, but it has not yet proven feasible to predict precisely how the enzyme may be 
improved. In such a situation, it may be appropriate to produce a library of mutants in which 
the changes are localised to that critical region of the protein. This could be achieved using 
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recombinant DNA techniques to replace the relevant region of the gene with a diversity of 
nucleic acid molecules that are completely random in sequence (with or without variation in 
length) to make a library of gene sequences. Clearly, the majority of mutations generated by 
randomised approaches are unlikely to result in improvement of the enzyme, and many 
5 changes are likely to be detrimental. However, the method of the invention allows the selection 
of beneficial variants out of a large negative background, meaning that the proportion of 
detrimental mutation events is largely irrelevant. 

A further aspect of the invention may involve a PCR-shuffling and selection process with a 
family of related genes, rather than using a single gene. For example, in the case of 
10 nitroreductase, genes from various different bacteria (e.g. species of Salmonella. Enterobacter, 
Helicobacter or Vibrio) might be used in the selection process. This procedure would facilitate 
sampling of a larger volume of "sequence-space" giving, greater diversity in the library, and 

•%Q hence more efficient generation of improved derivatives (Crameri et ai., 1998). 

m 



It is envisaged that there could be synergy between the "random" approach, and the structure- 
y 15 guided, site-directed mutagenesis approach. Structural analysis of improved variants would 
help evaluate how the enzyme works, and structure-guided, site-directed mutants could be 
optimised by random shuffling mutagenesis and selection. 

The bacteriophage library will generally comprise around 10 6 variants, as is typical with 
lambda libraries, although either smaller or larger libraries could be used. After introduction of 
20 the gene library into bacteriophage, the bacteriophage library is propagated in bacteria and may 
be stored for subsequent use (see Sambrook et al. 1989). 

Choice of host bacteria is also of importance. The host bacterium must support the lytic and 
lysogenic growth of bacteriophage vector and must retain the ability of the wild-type RecA 
protein to activate lysogens into lytic growth upon exposure to agents which damage DNA or 
25 otherwise disrupt DNA synthesis in the bacterium. Many bacterial strains are suitable for the 
method of the invention, although E. coli is preferred. The production of lysogens will be 
facilitated if the bacterium has a mutation such as hflA~ (high frequency of lysogeny), which 
favours the establishment of lysogeny. Without this host mutation, most bacteriophage would 
first grow lyrically, and lysogenised bacteria would only emerge against a background of lysis. 
30 The presence of an antibiotic-resistance gene in the bacteriophage is thus useful to assist the 
selection of lysogenised bacteria, as those bacteria which were not infected and thus cannot 
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become lysogenised will be inhibited or killed by the antibiotic. Other forms of selection are 
also possible,for example s based on complementation of metabolic/nutritional deficiencies of 
an appropriate host strain. 

In general, each lysogenised bacterium will contain only a single bacteriophage genome, since 
5 the initial infection will be performed with an excess of bacteria relative to infectious 
bacteriophage particles; and once lysogeny is established, the cl repressor in the cell prevents 
expression of lytic cycle genes and of the genes required for integration, from the genomes of 
infecting bacteriophages. 

A further advantageous feature of the bacterial host strain, to facilitate the selection procedure, 
is that the bacteria should be negative for the prodrug activating enzyme. For example, many 
strains of E. coli contain an endogenous, functional nitroreductase gene. A suitable deletant 
strain may be publicly or commercially available. If not, the generation of a strain in which the 
enzyme is deleted will be within the normal skills of those practised in the art. Preferably, the 
strain used should lack a functional nitroreductase. 

The bacterial strain E. coli C600H/1 (Promega cat. No. D3131) contains a functional 
nitroreductase gene. This gene could be mutated, and deletants selected for. Other methods of 
mutation will be clear to the skilled artisan, for example, by first introducing a mutation into a 
cloned copy of the nitroreductase gene, such as by: insertion of the omega-(Q)-interposon 
described by Prentki, P and Krischu H.M., [1984], Gene 29;303-313 or in vitro insertional 
mutagenesis with a selectable DNA fragment in which a streptomycin/spectinomycin- 
resistance gene is flanked by translational and transcriptional termination signals. The mutated 
gene could then be introduced into the bacteria (without any attached replication origin), and 
the bacteria selected with streptomycin or spectinomycin; thus one would select for bacteria in 
which the mutated gene had recombined into the host genome, and these could then be checked 
to confirm the inactivation of the endogenous nitroreductase gene. 

In the method of the invention, the bacteriophage library must be introduced into host bacteria 
and the lysogenic state established. Under growth conditions the bacteria multiply, replicating 
their own DNA and that of the bacteriophage, so that copies of the lysogenic bacteriophage are 
transmitted to daughter cells. 

30 Under conditions of environmental stress, and in particular in response to DNA damage, the 
SOS pathway is activated in the bacterium, activating machinery required for repair of the 
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DN A damage. The inducing signal is also a signal for reactivation of lysogenic bacteriophage. 
Specifically, the RecA protease of the bacterium cleaves the cl repressor of the bacteriophage, 
allowing lytic cycle genes to be expressed. The bacteriophage genome is excised from the 
bacterial genome, replicated, and assembled into progeny bacteriophage particles, which . are 
released by lysis of the host cell. Drugs that act on DNA, causing single-stranded lesions, 
crosslinking or breakage of the DNA strands, or drugs which act in other ways to inhibit the 
process of DNA replication, can all lead to the generation of an excess of single-stranded DNA 
in the cell and hence activate this pathway. Accordingly, those cells containing active prodrug 
activating enzyme will convert more prodrug to its active drug form and will consequently lyse 
earlier than cells containing a less efficient enzyme. 

There will be a background of spontaneous bacteriophage release, which will contaminate the 
"selected" bacteriophage population. As will be clear to the skilled artisan, the relative 
numbers of background and selected bacteriophage can be managed by appropriate choice of 
prodrug concentration and duration of exposure (which will determine the fraction of the 
culture in which lysis is induced). 

Preferably, the bacteriophage selected can be used immediately to generate more lysogens for 
further iterative rounds of selection; each round of selection will reduce the fraction of 
irrelevant bacteriophage in the "background". The selected variant genes could then be 
isolated,for example, by appropriate restriction endonuclease digestion or by PCR and used for 
further rounds of shuffling (to allow recombination of beneficial mutations from different 
bacteriophages) and mutagenesis, then inserted back into the bacteriophage vector and the 
selection process repeated. In this fashion, progressive improvement of the selected gene 
populations results. Preferably, at least two, more preferably three or more rounds of shuffling 
followed by selection are used. 

After each round of the selection, some of the variant genes may be analysed directly (for 
example, by first piaqueing the phage to isolate single clones, then sequencing; by prodrug 
activation-mediated cytotoxicity in bacteria, or by enzymological analysis after expression in 
bacteria). The most promising of the variants can be used directly for antibody-directed 
enzyme prodrug therapy (ADEPT) or (after optional modification of codon usage) for 
gene/virus directed enzyme prodrug therapy (G/VDEPT) applications. It is also possible to 
"backcross" the optimised constructs with the original nitroreductase gene during a round of 
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shuffling and selection, to identify which amino acid substitutions are truly beneficial and 
which are "passenger mutations" of little consequence for the functionality of the enzyme. 

The method of the invention is suitable for any combination of activating enzyme and prodrug, 
provided that prodrug activation results in activation of the proteolytic activity of the bacterial 
RecA protein. Nitroreductase is one example of such an enzyme and CB1954 is an example of 
a suitable prodrug substrate. CB1954 is a relatively poor substrate for wild type nitroreductase: 
the turnover rate k ca t is just 360 min" 1 and the Michaelis constant K M is high at 860 (9). 
This enzyme is thus an ideal choice for the selection method of the present invention. Other 
enzymes which might be suitable for optimisation for their prodrugs include thymidine kinase, 
cytosine deaminase or purine nucleoside phosphorylase. The mammalian DT-diaphorase 
enzymes, which can also activate CB1954 though less efficiently than nitroreductase, could 
also be optimised for this purpose. Other examples will be clear to those of skill in the art. 

The choice of prodrug will depend on the system being studied, and will clearly be a substrate 
for the prodrug activating enzyme of interest. In the case of thymidine kinase, the prodrugs 
acyclovir and/or ganciclovir will be suitable. For cytosine deaminase, 5-fluorocytosine is 
converted to the active form 5-fluorouracil. For purine nucleoside phosphorylase, the product 
of the E, coli DeoD gene, 6-methyl-9-(2-deoxy-P-D-erythro-pentofuranosyl) purine (6-MPDR, 
a deoxyadenosine analogue) is an example of a suitable prodrug. Prodrug substrates other than 
CB1954 have been described for nitroreductase (Anlezark et aL 7 1995; Friedlos el al y 1997). 
Some of these prodrugs have certain advantages over CB1954, for example, SN 23862 (the 
bischloroethyl analogue of CB1954) is not activated by mammalian DT diaphorases, and so 
may have lower toxicity. 2-[N,N-bis(2-iodoethyl)amino]-3,5-dinitrobenzamide, which has 
shown promising properties in vitro, could be applied to the method of the present invention. 
Similarly, other enzymes could be tested with the same prodrugs, for example, rat DT 
diaphorase has the advantage of selectively reducing CB1954 at the 4 position, although the 
k cat is rather lower than for nitroreductase. Nitroreductases from bacteria other than E. coli 
provide alternative starting points. Thus, one could in principle start with several different 
nitroreductase genes from different families to facilitate sampling of a larger volume of 
vi sequence-space'XCrameri etai, 1998). 

The concentration of prodrug to be used in the method of the invention is selected so as to 
facilitate the selection process and will thus depend on factors such as the relative proportion of 
the library which is thought to harbour active enzyme, the size of the library and the 
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approximate activity of the optimal enzyme. For example, at low concentrations of prodrug, 
only lysogens containing the most efficient enzymes should within a reasonable time convert 
adequate prodrug to generate DNA damage sufficient to activate RecA protease activity and 
hence induce lytic .growth of the bacteriophage in that bacterium. Thus, at limiting 
concentration of prodrug, only the bacteriophage containing the "best" prodrug activating 
enzyme genes are activated into the lytic pathway. This concentration of prodrug thus allows 
recovery of variant genes that made their bacterial hosts most susceptible to prodrug. 

In the case of nitroreductase and CB1954, a suitable concentration of prodrug is aroundlO- 
100nM. For acyclovir, gancyclovir, 5-fluorocytosine etc, a suitable concentration is around 0.1 
- 1 fig/ml. 

Suitable media in wh\ch to carry out the method of the present invention will be clear to those 
of skill in the art. For Lcample, for E. colU Luiria-Bertani (LB) broth is suitable. In addition to 
prodrug, the media m^y contain other necessary additives, for example, antibiotic or an 
inducer, for example, IPTG. 

Bacteriophage particles released by lysis of host bacteria may easily be separated from non- 
lysed bacteria. For example, treatment of media samples with chloroform (-1-10%) is 
sufficient to kill any bacteria. The lysate can then be centrifuged briefly and the supernatant 
will contain released bacteriophage that can be used to propagate in bacteria and to select 
further for bacteriophage containing genes of interest. 

According to a further aspect of the invention there is provided a catalytic enzyme or fragment 
thereof, isolated according to a method described above. Preferably, the catalytic enzyme will 
comprise an improved variant of wild type nitroreductase, thymidine kinase, cytosine 
deaminase or purine nucleotide phosphorylase. The amino acid sequence of the protein or 
protein fragment will differ from the wild type sequence of the protein or protein fragment, by 
amino acid insertion, deletion or substitution. 

According to a still further aspect of the invention there is provided a nucleic acid molecule 
encoding a catalytic enzyme selected according to the method of the invention. Such a nucleic 
acid molecule may be obtained by sequence analysis of the genome of bacteriophage selected 
for containing an enzyme variant of improved catalytic activity in the particular system studied. 
It is envisaged that information obtained by such sequencing will be useful to allow analysis of 
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the structure of the protein of interest and will allow the further improvement of the protein 
activity by rational or random mutagenic strategies. 

The invention will now be described by way of example, with particular emphasis on the 
enzyme nitroreductase and prodrug CB1954. It will be appreciated that modification of detail 
may be made without departing from the scope of the invention. 

All documents mentioned in the text are incorporated herein by reference. 
BRIEF DESCRIPTION OF THE FIGURES 

JigureHs a graph showing the induction of phage release from 5:1 mixture of lysogens JG3J1 
(control) and JG16Cl(ntr) by 0.5mM CB1954. 

_Figure2 shows the type of phage released from lysogen mixture, as determined by PCR. 
_Figure 3 shows the amount of nitroreductase when fully induced in 5UG3J1 lysogens. 
EXAMPLES 

1. Construction of a suitable vector 

Information relating to recombinant DNA technology techniques can be found in (Sambrook et 
al, 1989). In addition, information of the biology of the bacteriophage lambda can be found in 
(Hendrix et al, 1983). 

Preparation of a promoter cassette. 

A DNA fragment containing the tac Promoter was obtained from plasmid pDR540 
(Pharmacia), by PCR amplification using pYimers 5'- ACCTGACGTCTAAGAAAC -3' and 
5'- GCTCTAGATTGTTATCCGCTCAC -3'.Vhe amplified DNA fragment was cleaved with 
restriction endonucleases £coRI and Xbal, an\ the major fragment (369 bp) was cloned 
between the EcoRl and Xbal sites of the widel\used vector plasmid, P UC19, to generate 
plasmid pPSl 133C2. The sequence of the insert wL verified by DNA sequencing (Sanger & 
Coulson, 1977). 

The tac promoter is a composite promoter, made from parts of the trp promoter and the lac 
promoter; it allows a high level of expression, which can be repressed by the lac repressor; 
thus, a high level of expression can be induced in the presence of inducers such as IPTG. 
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Insertion of paired Sfil sites, to facilitate directional cloning of suitable inserts. 

The two oligonucleotides P^1133A (5'- CTAGGGCCTGCGAGGCCTTAATTAA- 
GGCCTCCCGGGCCT -3*) and\ PS1 133B (5'- CTAGAGGCCCGGGAGGCCTTAA- 
TTAAGGCCTCGCAGGCC -3') were annealed together to generate a short piece of DNA 
containing two Sfil sites separated by aVad site, and with 4 nucleotide 5 ? extensions on either 
end compatible with ligation into Xbal sites. However, only that at the right end regenerates the 
Xbal site: 

Sfil * Pad Sfil 

PS 113 3 A5 ' CTAG GGCC T GCG A GGCC TTAATTA AGGCC TCCCGGGCCT 3 
PS113 3B 3' CCGGACGCTCCGGAATTAATTCCGGAGGGCCCGGAGATC 5' 



This was ligated into the Xbal site of plasmid pPS1133C2. A derived plasmid clone named 
pPSl 133L10 was identified, in which a single copy of the 39bp insert was present within the 
plasmid, in the orientation which regenerated XhcXbal site downstream of the insert (i.e. the tac 
promoter, and the Xbal site are on opposite sides of the inserted 39 bp sequence). 

Production of a plasmid with paired Sfil sites 

The plasmid pETllc (Nbvagen) was cleaved with Xbal, and the following annealed 
oligonucleotides were cloneainto that site: 

PS1134A 5 CTAGAGGCCTOCGAGGC 3 



PS1134B 3 TCCGGACGSTCCGGATC 5 



The resultant plasmid pPS113 v 4D4 was identified with the single insert in the intended 
orientation, which regenerated thecal site on the upstream side of the insert (i.e. nearer to the 
romoter). 

Plasmid pPS1134D4 was cleaved with Barhtil, and the following annealed oligonucleotides 
were cloned into that site: 
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PS1134C 5' GATCCGGCCTCCCGGGCC 3' 
PS1134D 3* GCCGGAGGGCCCGGCTAG 5' 

The resultant plasmid pPS1134S5 has a single copy of the insert in the orientation which 
regenerates the BamHl site upstream (towards the promoter). 

Thus, the coding region of genes can be inserted between an Ndel (at the initiation codon) and 
a BamHl site in this vector. The vector then provides a "Shine-Dalgarno" sequence an 
appropriate distance upstream of the start codon, to facilitate binding of ribosomes to the RNA 
transcribed from the gene, and hence efficient translation to produce the encoded protein. This 
is flanked on either side by the paired Sfil sites, allowing intervening fragment to be cloned 
directionally into the bacteriophage vector to be described. The T7 promoter present upstream 
of this cassette in pPSl 134S5 also allows efficient expression of the protein from this plasmid, 
in a suitable host strain such as E. coli BL21 [>.DE3]. 

Insertion of the E. coli nitroreductase gene into pPSl 134S5 

DNA sequence derived fronAhe nitroreductase gene infnB) of E. coli strain DH5a was 
amplified by the polymerase chaih reaction from genomic DNA purified from that strain, using 
primers \ 

PS 1 1 38A 5'- GGGAATTCCATaTGGATATCATTTCTGTCGCCTTAAAGC -3' and 
PS1 138B 5'- CGCGGATCCTGAGAQGAAATAGCCGGGCAGATGC -3'. 

The amplified DNA was digested with the enzymes BamHl and Ndel, and the main fragment 
(-700 bp) was cloned between the Ndel and BamHl sites of pPSl 134S5, to produce pJG16Cl; 
the sequence of the inserted fragment containing the nitroreductase coding sequence was 
verified by DNA sequencing. 

Insertion of the tac promoter cassette into lambda A.NM1 151 

The lambda vector XNM1151 (Murray, 1983) appeared to meet most of the essential criteria, 
although the immunity region of this bacteriophage is derived from bacteriophage 21, and is 
reported as imm^d 15 , i.e. a temperature-sensitive mutant of the repressor. The bacteriophage 
21 is closely related to lambda, however it was not clear whether the induction from lysogeny 
mediated by the imm 2, dts would be as efficient as that with the canonical, imm\:I + . This 
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XNM1151 vector had unique restriction enzyme sites for EcoRl, HindiU and BamHl. It was 
therefore decided to produce the vector based on this bacteriophage, however it would be 
possible, if the imm 2l cl ts proved less than ideal, to carry out a genetic cross with wild-type X to 
swap the imm 21 cl ls for imm'Tf. 

A sample of XNM1151 bacteriophage was provided by Prof. N. Murray (University of 
Edinburgh). The tac promoter cassette, including the pair of Sfii sites, was obtained as a 
HindiU fragment from pPSl 133L10, and cloned into the HindiU site of A.NM1 151 to generate 
^PS1141A5. 

Insertion of a kanamycin-resistance cassette into XPS1 141 AS. 

A Nhel-BamHl fragment encoding the aminoglycoside ^'-phosphotransferase gene 
(kanamycin resistance) was isolated from pACYC177 (New England Biolabs). The ends were 
made blunt by treatment with Pfu DNA polymerase (Stratagene), and the fragment was cloned 
into pHP45Q which had been digested with Smal, allowing the kanamycin resistance gene to 
replace the "interposon" in that vector (Prentki, P., and H. M. Krisch. 1984) . This step allowed 
the kanamycin resistance gene to be subsequently isolated as an EcoJU fragment, which was 
inserted into the single £coRI site of *.PS1 141 AS, to generate kJG3]\. 

Insertion of the £. coli nitroreductase gene into the expression site of A.JG3J1. 

The nitroreductase gene of £. coli strain DHScc was excised from plasmid pJG16Cl by 
digestion with 5)71, and inserted between the two Sfil sites of ?JG3J1. The asymmetry of the 
S/Il sites ensured that the nitroreductase gene could only be inserted in the orientation required 
to allow its expression from the tac promoter. The resulting bacteriophage was called 
AJG16C1. 

2. Recovery of catalytic bacteriophage 
AIM: 

To test whether the vector >JG16C1 (encoding a functional, wild-type nitroreductase) could be 
preferentially recovered from a mixture with XJG3J1 (lacking a nitroreductase insert), by 
treatment of the mixed population of lysogens with CB1954. 



Method: 
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Exoli C600///7 (obtained from Promega Cat No. D3131) were infected with >JG3J1 or 
*JG16C1 for 15min at 37°C and spread onto kanamycin (30|ig/ml) selective Lennox agar (LA) 
plates to select for lysogens. A single colony of each was picked, and the lysogenic bacteria 
were then grown aerobically at 37°C to mid log phase in LB containing 0.1 mM IPTG. 
5 Approximately 4.8 x 10 8 AJG3J1 lysogens and 9 x 10 7 X JG16C1 lysogens (as estimated from 
the optical density; approximate 5:1 ratio withoutiwith nitroreductase) were mixed in 5ml LB 
containing 0.1 mM IPTG in the presence or absence of 500 uM CB1954 (provided by Cobra 
Therapeutics and diluted from a 50mM stock solution in 2:7 N-methyl pyrroIidinone:PEG300). 
The mixtures of bacterial lysogens were incubated at 37°C with shaking and 50u.l samples 
10 removed at intervals between 15 and 185 minutes after mixing. 

Each sample was immediately mixed with 1ml LB and lOOul chloroform to kill any bacteria, 
briefly centrifuged to pellet debris, and the supernatant diluted 1 in 10 before using 10 |il to 
i|f infect E. coli C60QHJ1 and plating onto LA containing kanamycin, to select for bacteria 

lysogenised by bacteriophage in the sample. 

The number of colonies growing from the samples was counted the following day. and 
indicates the total number of bacteriophage present in the sample. 

The polymerase chain reaction (PCR) vias used to determine whether the bacteria in each 
colony contained XJG3J1 or XJG 1 6C 1 . A.raimber of individual colonies were picked at random 
from the most relevant plates and transferred to a 200ul PCR tube, lysed in a microwave for 
2min and 35ul of PCR reaction mix [16mM (NH 4 )2S0 4 , 67mM Tris-Cl (pH 8.8 at 25°C), 
0.01% Tween-20 (Bioline), 0.2mM each dNTY, 1.5mM MgCl2, lu Biotaq (Bioline)] plus 0.25 
uM primers JG2A 5'-TGGCGGAAAGGTATGCATGC and; 

JG2B 5 , -CAGAGCATTAGCGCAkGGTG, 

which anneal to X sequences flanking the Hindlll < he, was added. 

25 A tube containing reaction mix but no bacteria was used as a control to check for 
contamination of PCR reaction components. After 30 cycles (94°C/5min; then 30x 94°C/45s, 
55°C/45s, 72°C/160s; then 72°C/10min) the PCR products were analysed by gel electrophoresis 
to determine the product size. Lysogens containing >JG16C1 expressing ntr gave a 1.1 kb 
product whereas XJG3J1 lysogens gave a 0.4kb product. All the colonies gave one or the other 
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of these two bands but not both, and the negative control confirmed the absence of 
contamination in the PCR. 

Results: 

The data are shown in the attached Table 1 and Figures. 

The samples from the culture exposed to CB1954 showed an increase in colony number from 3 
to -360 over the timecourse, corresponding to an increase in the concentration of total 
bacteriophages released into the medium which increased from 6 x 10 5 per ml at early times, to 
7.2x 10 7 per ml at 155 minutes of the timecourse (see Table 1). In the parallel culture not 
treated with CB1954, the maximum concentration of bacteriophage released was 8 x 10 6 per 
ml, at 90 minutes. 

PCR analysis of randomly selected colonies at 90 and 125 minutes of the no-prodrug 
timecourse, indicated that 1/10 and 0/10 of these contained the XJG16C) bacteriophage, and 
the remaining 19/20 contained the >JG3J1 bacteriophage. In contrast, from the culture exposed 
to CB1954, a far greater proportion of the AJG16C1 bacteriophage (containing the 
nitroreductase gene) was recovered overall. At 60 minutes, when similar numbers of 
bacteriophage were recovered ±CB1954, only 1 of 1 1 colonies screened contained the 
7JG\6Cl bacteriophage. However the proportion of AJG16C1 increased rapidly to a peak of 
60% (18/30) at 90 minutes, declining thereafter. The peak concentration of the ^JG16C1 was 
reached at 125 minutes (3 x 10 7 per ml) [see Figure 1]. 

Discussion 

The C600H/I strain of E. coli contains an endogenous nitroreductase gene. Thus, all the 
bacteria used in this experiment contain some nitroreductase. This can be visualised as a faint 
protein band on an SDS-polyacrylamide gel analysis of total bacterial protein and stained with 
Coomassie blue, and also by blotting such a gel and visualising the nitroreductase selectively, 
by use of a suitable antibody (data not shown). This has been done for lysogens containing 
either XJG3J1 or XJG16C1, by collecting samples as a timecourse following induction with 
IPTG. It could thus be estimated that the XJG 16C1 lysogen contained around 5-10-foid more 
nitroreductase when fully induced, than either control cells, or the >JG3J1 lysogen (see Figure 
3). The presence of the endogenous nitroreductase in the >JG3J1 lysogen would be expected to 
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lead to some activation of CBI954; however the 5-10-fold greater level of the enzyme present 
in the AJG16C3 lysogen would be expected to lead to a faster rate of CB1954 activation. 

The greater number of total bacteriophage released from 60 minutes onwards in the CB1954- 
treated culture, and the preferential recovery of AJG16C1, provides a strong confirmation that 
exposure to CB1954 is indeed able to trigger the inactivation of the cl repressor leading to lytic 
bacteriophage growth; and that this is as a consequence of its activation by nitroreductase. 

The background level of bacteriophage released into the medium in the culture without 
CB1954 is expected, as it is known that there can be spontaneous lytic growth from lysogens, 
without obvious external stimuli. For the first -60 minutes of exposure to CB1954, there was 
no increase in the level of bacteriophage above background. Three factors will contribute to 
this delay; firstly, the amount of CB1954 which is activated and the resulting DNA adducts in 
the cell will increase with duration of exposure. Second, following activation of the RecA 
protease, it takes a finite time for this to deplete the level of d sufficiently to trigger lytic 
bacteriophage growth (perhaps 30 minutes, as reported for induction by ultraviolet light). 
Third, the duration of the lytic cycle is approximately one hour. The clearly demonstrated 
preferential recovery of XJG16C1 at times from 75 to 125 minutes can therefore be explained 
by the faster activation of CB1954 in these cells, leading to earlier triggering of lysis in the 
cells harbouring XJG16C1 than in those harbouring AJG3J1. 

Note that a ''bystander effect'" has been described in eukaryotic cells, whereby activated 
prodrug produced in one cell can diffuse and exert an effect upon neighbouring cells that may 
lack the activating enzyme. It is expected that a similar effect could operate in bacteria. In 
order to minimise this effect, the experiment was performed at a fairly low initial cell density. 
It is possible that at later times, a bystander effect has contributed to the induction of lysis in 
cells with the /JG3J1 prophage. However this experiment has clearly demonstrated that any 
bystander effect which may be present does not prevent the performance of the selection as 
described. 

This experiment has utilised a vector containing a wild-type nitroreductase gene, in comparison 
with one lacking a nitroreductase gene, to model the possible distinction that may exist in a 
library between more- and less-functional nitroreductase genes. In this preliminary experiment, 
both lysogens contained a background level of nitroreductase contributed by the endogenous 
gene. Based on the estimate of relative levels of the nitroreductase protein present in two 



WO 00/43541 



PCT/G B00/001 57 



20 

iysogens in the presence of IPTG, we can deduce that the ability of the XJG16C1 lysogen to 
activate CB1954 would be about 5-10 fold greater than that of the AJG3J1 lysogen. At the 
time of mixing, the >JG16C1 formed 16% of the mixture, whereas at the 90 minute time point. 
60% of the recovered bacteriophage were 2JG16C1, representing a -4-fold enrichment. 

It is predicted that there will be several ways in which this degree of enrichment can be 
increased. 

1) Perform multiple rounds of selection, using the selected bacteriophage from the first round 
as the input to the second round. It would be expected that a similar degree of enrichment 
would be obtained, resulting in an enrichment of about 16-fold overall of the 2JG16C1 
bacteriophage after 2 rounds. 64-fold after 3 rounds, etc. 

2) To the extent that the AJG3J1 Iysogens were being induced by the endogenous 
nitroreductase present within those cells rather than a "bystander effect", it would be possible 
to decrease the amount of ^JG3J1 recovered by using a strain of bacteria in which the 
endogenous nitroreductase gene had been deleted or inactivated. For example, one could use 
the E. coli strain UT5600 (available from the E. coli Genetic Stock Center, at Yale). This strain 
does not carry the Hjl mutation, and so most of the bacteria initially infected with the 
bacteriophage would be lysed by the bacteriophage, with release of the progeny bacteriophage. 
However a proportion of the cells would be lysogenised. and these could be selected either by 
their survival once the remaining cells had succumbed to lysis, or they could be selected based 
on the acquisition of kanamycin resistance, or both. Alternatively, it would be possible to 
inactivate the nitroreductase gene within the C600H/I strain, by a variety of alternative 
methods. 

3) As mentioned earlier, it was not clear at the outset whether Iysogens with the imm^cl 15 
would be inducible by DNA damaging agents in the cell. This experiment has shown that it 
does function adequately in this regard, however it remains possible that an alternative 
immunity region, such as that of the wild type lambda bacteriophage imn/cl~\ may allow a 
greater discrimination. 

4) The lytic cycle of lambda takes about one hour to complete, and cleavage of the cl repressor 
following UV light (and so, perhaps, also in response to the DNA lesions cause by activated 
CB1954) takes about 30 minutes; thus in this experiment it would appear that activating levels 
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of DNA lesions must have been produced within a few minutes of exposure to CB1954, in the 
XJG16C1 lysogen, and (5-10-fold slower,) within 30 minutes to I hour in the >JG3J1 lysogen. 
A greater temporal separation between the release of the two types of bacteriophages would be 
expected if a lower concentration of CB 1954 were used, so that the time to induction of the 
two lysogens would be increased proportionately, giving e.g. a 30 minute delay to the 
XJG16C1 lysogen and a 2.5-5 hour delay to induction of the AJG3J1 lysogen. The duration of 
the lytic cycle would be unaffected, and thus there would be less overlap between the release of 
the two types of bacteriophage. 

5) Although it appears unnecessary to invoke significant diffusion of activated prodrug 
between cells as contributing the induction, it would be possible to further reduce any possible 
contribution of this by a number of approaches: performing the experiment at a lower cell 
density; inclusion of substances in the medium to reduce spread of activated prodrug (e.g. 
serum proteins); or by performing the induction on a solid or semi-solid substrate, e.g. agar 
plates, such that any activated prodrug released by a cell would only diffuse a short distance, 
rather than being spread throughout the culture as in a shaking liquid culture. 

The principles established by this experiment indicate that the selection method and vector will 
also be effective when applied to a library of nitroreductase variants. In doing so, this 
experiment has shown that it would be desirable to perform the induction with lower 
concentrations of CB 1954. (e.g. IOOuM or lower), and to collect the released bacteriophages at 
intervals. The number of bacteriophage particles released would be monitored and compared 
with parallel cultures lacking CB1954; this would inform the decision which time-point sample 
to choose for further rounds of selection. One would perform several rounds of selection (e.g. 
3-5) in order to reduce the level of "background' 7 bacteriophages present; and then recover the 
inserts from the bacteriophages, e.g. by PCR, to enter them in further rounds of PCR-shuffling. 
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